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Abstract 

In 1964, John Bell proved that quantum mechanics is “unreasonable” (to use Einstein’s term): 
there are nonlocal bipartite quantum correlations. But they are not the most nonlocal bipartite 
correlations consistent with relativistic causality (“no superluminal signalling”): also maximally 
nonlocal “superquantum” (or “PR-box”) correlations are consistent with relativistic causality. I 
show that—unlike quantum correlations—these correlations do not have a classical limit consistent 
with relativistic causality. The generalization of this result to all stronger-than-quantum nonlocal 
correlations is a derivation of Tsirelson’s bound—a theorem of quantum mechanics—from the 
three axioms of relativistic causality, nonlocality, and the existence of a classical limit. But is it 
reasonable to derive (a part of) quantum mechanics from the unreasonable axiom of nonlocality?! 
I consider replacing the nonlocality axiom with an equivalent axiom that even Bell and Einstein 
might have considered reasonable: an axiom of local retrocausality. 


*To appear in Quantum Nonlocality and Reality: 50 Years of Bells theorem , eds. S. Gao and M. Bell 
(Cambridge U. Press), 2015, in press 
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In 1964, John Bell [1] proved that quantum mechanics is “unreasonable”, as defined by 
Einstein, Podolsky and Rosen j2j in 1935: “No reasonable definition of reality could be 
expected to permit this.” “This” (i.e. violation of “Einstein separability” to use a technical 
term, or “spooky action at a distance” as Einstein put it) turns out to be endemic to 
quantum mechanics. For example, pairs of photons measured at spacelike separations may 
yield nonlocal quantum correlations, i.e. correlations that cannot be traced to any data or 
“programs” the photons carry with them. As Bell [3] put it 20 years later, “For me, it is 
so reasonable to assume that the photons in those experiments carry with them programs, 
which have been correlated in advance, telling them how to behave. This is so rational that 
I think that when Einstein saw that, and the others refused to see it, he was the rational 
man. The other people, although history has justified them, were burying their heads in the 
sand. I feel that Einstein’s intellectual superiority over Bohr, in this instance, was enormous; 
a vast gulf between the man who saw clearly what was needed, and the obscurantist. So 
for me, it is a pity that Einstein’s idea doesn’t work. The reasonable thing just doesn’t 
work.” True, history has confirmed nonlocal quantum correlations; but has history passed 
judgment on Einstein and “the others”? Consider what Newton (TJ wrote about his own 
theory of gravity: “That gravity should be innate, inherent and essential to matter so that 
one body may act upon another at a distance through a vacuum without the mediation 
of anything else, by and through which their action or force may be conveyed from one 
to another, is to me so great an absurdity that I believe no man who has in philosophical 
matters any competent faculty of thinking can ever fall into it.” More than two centuries 
years later, Einstein confirmed Newton’s misgivings: gravitational interactions are indeed 
local. Einstein’s theory of gravity is free of the “absurdity” of action at a distance. Well, 
if it took centuries for history to justify Newton’s rejection of action at a distance, couldn’t 
history yet justify Einstein’s rejection of action at a distance? 

This paper offers, not the reasonable thing, but a reasonable thing that just might work. 
Section I reviews the search for simple physical axioms from which to derive quantum me¬ 
chanics. Ideally, such a search could help us understand the theory and make it seem more 
reasonable. But, while we can derive a part of quantum mechanics from three simple phys¬ 
ical axioms, one of the three axioms is the unreasonable axiom of nonlocality! Apparently, 
we are no better off than before. However, Sect. II considers replacing the axiom of non¬ 
locality with an axiom that even Bell and Einstein might have considered reasonable: an 
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axiom of local retrocausality (microscopic time-reversal symmetry). Then Sect. Ill rewrites 
the derivation in Sect. I using the three axioms of causality, local retrocausality, and the 
existence of a classical limit. 

I. NONLOCAL CORRELATIONS IN THE CLASSICAL LIMIT 

It is convenient to discuss Bell’s inequality in the form derived by Clausner, Horne, 
Shimony and Holt [5] (CHSH) for spacelike separated measurements by “Alice” and “Bob” 
on a bipartite system: 


\C(a,b) + C(a,b , ) + C(a\b)-C(a\b')\<2 , (1) 

where a, a', b and b' are observables with eigenvalues in the range [—1,1]; Alice measures a 
or a' in her laboratory, Bob measures b or b’ in his laboratory, and the correlations C(a,b), 
etc., emerge from their measurements. Correlations that violate Eq. (|TJ) by any amount are 
nonlocal. But it is a curious fact, discovered by Tsirelson |6], that the violation of Eq. (|TJ) 
by quantum correlations Cq(a,b), etc., is bounded by 2y/2: 

I C Q (a, b ) + C Q (a, b /) + C Q (a\ b) - C Q (a ', b’)\ < 2^2 , (2) 

even though it is straightforward to define “superquantum” correlations 

C SQ (a, b) = C SQ (a , b') = C SQ (a', b) = 1 = -C SQ (a !, b') (3) 

that violate Eq. ([l]) maximally. A good guess is that superquantum (or “PR-box” 0) 
correlations are too strong to be consistent with relativistic causality, but this guess is easily 
disproved 0 0. Just assume that when Alice measures a or a' she gets ±1 with equal 
probability, and likewise when Bob measures b or b'\ this assumption is consistent with Eq. 
(j3j) and it implies that Alice and Bob cannot signal to each other, since in any case Alice 
and Bob obtain ±1 with equal probability from their measurements. 

Nonlocal quantum correlations are unreasonable, and not even maximally unreasonable! 

But perhaps we should not be so surprised that PR-box correlations are consistent with 
relativistic causality. After all, we have set up quite an artificial comparison. We have not 
compared two theories. We have compared nonlocal quantum correlations belonging to a 
complete theory—quantum mechanics—with ad hoc super-duper nonlocal correlations that 
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do not belong to any theory we know. We are not even comparing apples and oranges. We 
are comparing a serial Nobel prize winner and a lottery winner. Quantum mechanics, as a 
complete theory, is subject to constraints. In particular, quantum mechanics has a classical 
limit. In this limit there are no complementary observables; there are only macroscopic 
observables, all of which are jointly measurable. This classical limit—our direct experience— 
is an inherent constraint, a kind of boundary condition, on quantum mechanics and on any 
generalization of quantum mechanics. Thus stronger-than-quantum correlations, too, must 
have—as a minimal requirement—a classical limit. 

And now the fun begins [9] . Consider the PR box and note that if Alice measures a and 
obtains 1, she can predict with certainty that Bob will obtain 1 whether he measures b or b'; 
if she obtains —1, she can predict with certainty that he will obtain —1 whether he measures 
b or b'. (By contrast, quantum correlations would allow Alice to predict with certainty only 
the result of measuring b or the result of measuring b' but not both pj.) If Alice measures 
a', she can predict with certainty that Bob will obtain her result if he measures b and the 
opposite result if he measures b'. Thus, all that protects relativistic causality is the (tacitly 
assumed) complementarity between b and b': Bob cannot measure both, although—from 
Alice’s point of view—no uncertainty principle governs b and b’. 

Next, suppose that Alice measures a or a' consistently on N pairs. Let us define macro¬ 
scopic observables B and B': 


B 


b\ + &2 T • • • T &/v 
N 


B' 


b\ T b '2 + • • • + b' N 
N 


(4) 


where b m and b' m represent b and b', respectively, on the m-th pair. Alice already knows 
the values of B and B and there must be some measurements that Bob can make to 
obtain partial information about both B and B’\ for, in the classical limit, there can be no 
complementarity between B and B’. Now it is true that a = 1 and a = —1 are equally likely, 
and so the average values of B and B’ vanish, whether Alice measures a or a'. But if she 
measures a on each pair, then typical values of B and B' will be ±1 /a /N (but possibly as 
large as ±1) and correlated. If she measures a 1 on each pair, then typical values of B and B’ 
will be ±1 /\/N (but possibly as large as ±1) and anti-correlated. Thus Alice can signal a 
single bit to Bob by consistently choosing whether to measure a or a 1 . This claim is delicate 
because the large-A r limit in which B and B’ commute is also the limit that suppresses 
the fluctuations of B and B'. We cannot make any assumption about the approach to 
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the classical limit; all that we assume is that it exists, e.g. that the uncertainty product 
ABAB' can be made as small as desired, for large enough N. On the other hand, the axiom 
of relativistic causality cannot grant Bob even the slightest indication about both B and 
B'. Hence all we need is that when Bob detects a correlation, it is more likely that Alice 
measured a than when he detects an anti-correlation. If it were not more likely, it would 
mean that Bob’s measurements yield zero information about B or about B', contradicting 
the fact that there is a classical limit in which B and B' are jointly measurable. 

To ensure that Bob has a good chance of measuring B and B’ accurately enough to 
determine whether they are correlated or anti-correlated, N may have to be large and 
therefore the fluctuations in B and B' will be small. However, Alice and Bob can repeat this 
experiment (on N pairs at a time) as many times as it takes to give Bob a good chance of 
catching and measuring large enough fluctuations. Alice and Bob’s expenses and exertions 
are not our concern. Relativistic causality does not forbid superluminal signalling only when 
it is cheap and reliable. Relativistic causality forbids superluminal signalling altogether. 

For example, let us suppose Bob considers only those sets of N pairs in which B = ±1 
and B’ = ±1. The probability of B — 1 is 2~ N . But if Alice is measuring a consistently, the 
probability of B — 1 and B' — 1 is also 2 -JV , and not 2 ~ 2N , while the probability of B — 1 
and B' = — 1 vanishes. If Alice is measuring a' consistently, the probabilities are reversed. 
(These probabilities must be folded with the scatter in Bob’s measurements, but the scatter 
is independent of what Alice measures.) Thus with unlimited resources, Alice can send a 
(superluminal) signal to Bob. Superquantum (PR-box) correlations are not consistent with 
relativistic causality in the classical limit. 

We have ruled out superquantum correlations ||9j. To derive quantum correlations, how¬ 
ever, we have to rule out all stronger-than-quantum correlations, i.e. we have to derive 
Tsirelson’s bound from the three axioms of nonlocality, relativistic causality, and the exis¬ 
tence of a classical limit. The proof appears elsewhere ra. 

The existence of a classical limit is not the only axiom we can consider adding to the 
axioms of nonlocality and relativistic causality. Alternative axioms m (or a stronger axiom 
of relativistic causality called “information causality” mi have been shown to rule out 
PR-box correlations, and come close to ruling out all stronger-than-quantum correlations. 
However, the physical significance of these axioms requires clarification. Navascues and 
Wunderlich [Q] consider an axiom for a classical limit, but define the classical limit via the 
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“wiring” [13] of entangled systems, and not via complementary measurements that become 
jointly measurable as the number N of systems grows without bound. 


II. LOCAL RETROCAUSALITY AS AN AXIOM 

Bell’s theorem rules out any locally causal account of quantum mechanics. But a number 
of authors, most notably Price [T5] . have suggested a locally causal-retrocausal account of 
quantum mechanics. Here “causality” means “relativistic causality” as before (i.e. no super¬ 
luminal signalling); what is “retro” is that the effect precedes the cause. If the retrocausality 
is local —no action at a distance—then the order of cause and effect is independent of the 
reference frame. Retrocausality is an expression of a fundamental time-reversal symmetry 
in physics. While time-reversal symmetry is not manifest at the macroscopic level—for 
example, a star emits more light than it absorbs—we explain the asymmetry by saying 
that the universe has not reached a state of maximum entropy. At the same time, almost 
all fundamental physical processes at the microscopic level exhibit time-reversal symmetry. 
Aharonov, Bergmann and Lebowitz (ABL) [16j derived an explicitly time-symmetric for¬ 
mula for intermediate quantum probabilities, conditioned on initial and final states; they 
suggested that quantum mechanics has no arrow of time of its own and that time asymmetry 
(e.g. in measurements) originates in macroscopic physics. While the ABL formula is not 
manifestly local, it opens the way to a local account of quantum mechanics via local retro¬ 
causality. Such an account would replace nonlocality with something not only local, but 
even palatable: a fundamental time-reversal symmetry of microscopic causality and retro¬ 
causality. Moreover, if the account includes the quantum correlations that violate Bell’s 
inequality, we can replace the axiom of nonlocality assumed in Sect. I with an axiom of 
local retrocausality, and try to derive quantum mechanics from the three axioms of (rela¬ 
tivistic) causality, local retrocausality, and the existence of a classical limit. Sect. Ill begins 
such a derivation. 

Remarkably, retrocausality is intrinsic to quantum mechanics, as we see if we consider 
three observers, Alice, Bob and Jim, who share an ensemble of triplets of spin-1/2 particles in 
the Greenberger, Horne and Zcilinger (GHZ) pT7] state |GHZ) = (| tAtetj) - 1 3a3b3j))/\/2- 
(See Fig. 1; Alice, Bob and Jim each get one particle in each triplet.) Let Alice and Bob, at 
spacetime points A and B , measure spin components and a ^ ■ n^, respectively, 
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(b) 


(a) 



FIG. 1: Configurations in which Jim can (a) causally and (b) retrocausally put pairs of particles 
shared by Alice and Bob in product or entangled states, as he chooses. The dashed arrows connect 
cause with effect. 

on their particles. For simplicity, let the unit vectors and (which may change from 
particle to particle) lie in the xr/-plane. Let Jim have the special role of the “jammer” 
[IS]; he chooses whether to put the particles held by Alice and Bob in a product state or 
an entangled state. To put them in a product state, Jim (at spacetime point J) measures 
cr[ J \ the ^-component of the spin of his particles. To put them in an entangled state, 
he measures, say, cr^ J \ It doesn’t matter when Jim makes his measurements. In Fig. 
1(a), Jim’s measurements precede Alice’s and Bob’s by a timelike separation; but in Fig. 
1(b), Alice’s and Bob’s measurements precede Jim’s by a timelike separation. Either way, 
Jim cannot send a superluminal signal to Alice and Bob, because his measurements leave 
the pairs held by Alice and Bob in a mixed state—either a mixture of the product states 
I tuts) and | iulu) or a mixture of the entangled states (| tuts) — I \-a\-b)) / y/% and 
(I tuts) + I IaIb))/V2- Without access to the results of Jim’s measurements, Alice and 
Bob cannot distinguish between these two mixtures. But with access to the results, they 
can bin their data accordingly and verify that their results either do or do not violate Bell’s 
inequality in accordance with whether Jim chooses to entangle their pairs or not. 

Thus, Fig. 1(b) nicely illustrates the fact that quantum mechanics is retrocausal—even 
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FIG. 2: (a) Alice and Bob independently measure (prepare) spin states of a pair of converging 
spin-1/2 particles; when the particles meet, Claire’s measurement leaves them in a product state 
or a Bell state, (b) In the reverse sequence, Claire’s measurement prepares a product state or a 
Bell state of the two (diverging) particles, for Alice and Bob to independently measure. 

if we take quantum nonlocality at face value without considering retrocauality. On the one 
hand, there is no reason to doubt that Alice, Bob, and Jim have free will. Indeed, the 
results of Alice and Bob’s measurements are consistent with whatever Jim chooses, right 
up to the moment when he decides to measure or on each of his particles and 
record the results. On the other hand, there is no doubt about the effect (in Jim’s past light 
cone) of Jim’s choice. After Alice and Bob obtain the results of Jim’s measurements (within 
his forward light cone) they can reconstruct from their data whether their particles were 
entangled or not at the time they measured them. Thus quantum mechanics is retrocausal 
(though not necessarily locally retrocausal). 

Now, to illustrate how local retrocausality could obviate nonlocality, consider Fig. 2. 
Figure 2(a) represents what might be called a “reverse EPR experiment”. Alice and Bob, at 
spacetime points A and B, measure spin components -n^ and a (ri> n#, respectively, on 
spin-1/2 particles approaching in pairs, in opposite directions, along the x axis. For each pair, 
Alice and Bob are completely free to choose the unit vectors and independently. The 
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spin states of the particles before they reach Alice and Bob are irrelevant, and the pairs leave 
in products of eigenstates of and ■ n B with eigenvalues ±1. Let |£a£b) denote 

these product states. At the point where the particles meet, Claire is free to make one of two 
measurements. She either measures a nondegenerate operator P with the product eigenstates 
I 1A1 \b)> I 4-aTb)> | Ta|b), and | (he. products of the eigenstates of a^ and cr^), 

or she measures a nondegenerate operator B with the Bell states (| ± I IaIb) /a/2 

and (| TaIs) ± | TaIb) /\/2 as eigenstates. The Bell states are entangled. Let Alice and 
Bob send Claire the results of their measurements. Suppose Claire chooses to measure the 
Bell operator each time; she can bin the data for each pair that arrives in her laboratory 
according to the Bell state that she finds for it. Over time, she will be able to measure the 
quantum correlations between Alice’s and Bob’s measurements from the binned data, for 
each Bell state. These quantum correlations are precisely the nonlocal quantum correlations 
that violate Bell’s inequality, since for quantum probabilities, and hence for correlations, 
time order does not matter: |(£a£b|6,;)| 2 = I (-£*■;|£a£b)| 2 > w h ere 1-6*) is any one of the four 
Bell states. Yet nothing even slightly nonlocal is going on here. The results of Alice’s and 
Bob’s measurements propagate locally and causally to Claire, who “clarifies” the overall 
state of each pair of particles that arrives in her laboratory with her measurement. The 
reason that Fig. 2(a) is locally causal is that local causality brings the results of Alice’s, 
Bob’s and Claire’s measurements all together at the spacetime point C. In Fig. 2(b), local 
causality cannot bring the results of Alice’s, Bob’s and Claire’s measurements all together 
at any point, because the particles in each pair diverge to spacetime points A and B. Thus 
the conditions for Bell’s theorem hold and the quantum correlations of Fig. 2(b), which are 
also the quantum correlations of Fig. 2(a), are nonlocal. 

Nevertheless, time-reversal symmetry suggests that Fig. 2(a) and Fig. 2(b) are analogous. 
Perhaps local retrocausality could play the role in Fig. 2(b) that local causality cannot play: 
local retrocausality could propagate the results of Alice’s and Bob’s measurements at A 
and B , respectively, backwards in time to bring the results of Alice’s, Bob’s and Claire’s 
measurements all together at the spacetime point C. Using the ABL formula, we can 
express the conditional probability that Claire’s measurement at spacetime point C yields 
the Bell state | Bj) as 

nrnhliPA I (gA&I^Afl ~ *c) \Bj) (B,\U(t c - t 0 ) |0) | 2 

p U j)) Xa=i |(^A^s|U(tAs — t c )\Bi){Bi\U(t c — h))|0)| 2 ’ u 
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where |0) is the state of the two spin-1/2 particles at time t 0 , before Claire’s measurement, 
tc is the time of her measurement, and t^B is the time of Alice’s and Bob’s measurements, 
which we can take (for simplicity and without lost of generality) to be simultaneous. The 
unitary operator U{tAB ~ tc) can be rewritten as U\tc — tAB ) to remove any arrow of time 
from the ABL formula: 

Kaa|Ct(t c -t.4B)|Bj)(gj|C(t c -t,)|0)| 2 

here the initial state |0) and final state |£a£b) both evolve locally towards the intermediate 
time of Claire’s measurement. 

The ABL formula realizes the time-reversal symmetry between Fig. 2(a) and Fig. 2(b). 
But we have already noted that time-reversal symmetry holds only for microscopic physics, 
and not for macroscopic physics. In particular, the Born rule belongs to the realm of macro¬ 
scopic physics. In Fig. 2(b), we can use \(£,a£,'b\Bi)\ 2 = |(-B,:|£a£b)| 2 to predict the probability 
of the state |£a£s) given the state | Bf), we cannot use it to retrodict the probability of the 
state | Bi) given the state |£a£#)- More concretely, Claire in Fig. 2(b) could certainly en¬ 
tangle two spin-1/2 particles by measuring on them an operator B with the Bell states 
as eigenstates; but in another experiment to test Bell’s inequality, the particles might be 
photons in a singlet state, produced by the decay of an excited state of an atom. If so, the 
time-reversed experiment—Fig. 2(a) in which the photons converge so precisely as to excite 
an atom—is much less likely. However, the ABL approach [161, US] is still valid: quantum 
mechanics (microscopic physics) contains no arrow of time, and the macroscopic arrow of 
time derives from thermodynamics and boundary conditions on the universe. If so, perhaps 
we can overlook the imperfect analogy between Fig. 2(a) and Fig. 2(b) and let retrocausality 
evolve the states at A and B back to C, where quantum probabilities determine the actual 
sequence of results. This retrocausal description fits naturally with the “two-state-vector” 
formulation of quantum mechanics na om¬ 
it is also consistent with free will, in the following sense. There would be a problem 
regarding free will if, say, Alice could obtain any information about what she measured before 
the measurement. Any physical theory that allowed such a causal loop would be inconsistent. 
But suppose Alice could not obtain any such information before the measurement, but 
someone else could. No causal loop could arise, but would we still say that Alice has free 
will? The question does not apply to Fig. 2(b) because no one has access to information 
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about Alice’s measurement before the event A: a normal (“strong”) measurement between 
A and C would eliminate the causal/retrocausal connection between the two events, and a 
“weak” measurement |21| could yield a result only after Alice’s measurement. 


III. PR-BOX CORRELATIONS FROM LOCAL RETROCAUSALITY 

We can now define a toy model for the PR box as a retrocausal box rather than a nonlocal 
box (as Argaman iza defined a toy model for bipartite singlet correlations). Returning to 
Fig. 2(b), we let Alice’s and Bob’s choices of what to measure (a or a' and b or b ', respectively) 
propagate retrocausally to C, where (for the PR box) choices (a, b), ( a,b') and (a',b) yield 
values (1,1) or (—1, —1) with equal probability, while choice ( a',b') yields values (1, —1) or 
(—1,1) with equal probability. (By analogy with the previous section, we could let Claire 
clarify if the box is a PR box or a different but equivalent box.) Then Alice and Bob’s 
measured correlations are PR-box correlations, i.e. the retrocausal box is equivalent to the 
nonlocal box. And now the conclusions of Sect. I apply to the retrocausal box just as they 
apply to the nonlocal box: Alice and Bob can violate the axiom of no superluminal signalling 
in the classical limit (the limit of arbitrarily many boxes). In other words, the PR-box is 
not causal in the classical limit. Just as in Sect. I, we can eliminate PR-box correlations 
as not satisfying the three axioms of causality, local retrocausality and the existence of a 
classical limit. Likewise, from these three axioms alone we can expect to derive Tsirelson’s 
bound—a theorem of quantum mechanics. 

To conclude, local retrocausality offers us an alternative to “spooky action at a distance”. 
Would Einstein have accepted it? Is local retrocausality a deep principle worthy of being 
an axiom? It is appropriate to let Bell have the last word [23]: 

“I think Einstein thought that Bohm’s model was too glib—too simple. I think he was 
looking for a much more profound rediscovery of quantum phenomena. The idea that you 
could just add a few variables and the whole thing [quantum mechanics] would remain 
unchanged apart from the interpretation, which was a kind of trivial addition to ordinary 
quantum mechanics, must have been a disappointment to him. I can understand that—to 
see that that is all you need to do to make a hidden-variable theory. I am sure that Einstein, 
and most other people, would have liked to have seen some big principle emerging, like the 
principle of relativity, or the principle of the conservation of energy. In Bohm’s model one 
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did not see anything like that.” 
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